A global approach for contig construction

نویسندگان

  • A. Gleizes
  • Alain Hénaut
چکیده

A program for assembling sequences by using a global approach has been developed. By successive steps, a more and more precise classification of DNA fragments permits the positioning of the sequences on the contig; after having detected the pairs of overlapping sequences, groups are formed such that all sequences in a group overlap. Sequences common to several groups enable the groups to be ordered in a series. Ambiguities in the order of groups can arise at this stage, due to the presence of repeated fragments; different solutions are then proposed. Putting the groups into order leads to a preclassification of sequences. The fragments are then aligned by group, by searching for words common to all sequences in the group, and using an algorithm of dynamic programming. A detailed example on a set of nine sequences accompanies the description of the method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Integrated Approach to Comparative Assembly

We describe a novel approach to comparative assembly that directly integrates anchoring alignments into the contig assembly process, enabling the extension of contig construction through the boundaries of repeat nodes in a compressed de Bruijn graph. Our method exploits anchoring alignments, paired-read constraints and read threading as path selection heuristics while an assembly graph is trans...

متن کامل

Parallel construction of orthologous sequence-ready clone contig maps in multiple species.

Comparison is a fundamental tool for analyzing DNA sequence. Interspecies sequence comparison is particularly powerful for inferring genome function and is based on the simple premise that conserved sequences are likely to be important. Thus, the comparison of a genomic sequence with its orthologous counterpart from another species is increasingly becoming an integral component of genome analys...

متن کامل

Construction of a YAC contig covering human chromosome 6p22.

A contig covering human chromosome 6p22 that consists of 134 YAC clones aligned based on the presence/absence of 52 DNA markers is presented. This contig overlaps with the 6p23 contig at its telomeric end and with the 6p21.3 contig at its centromeric end. The order of loci within the contig resolves the relative positions of several genetically mapped markers. Among the additional markers used ...

متن کامل

An occupational risk assessment approach for construction and operation period of wind turbines

As wind energy is one of the most important renewable energy sources over the globe, need for increasing safety for this type of energy is gaining importance. Although this sector is not suffering an excessive amount of fatal injury accidents, there are many aspects open for improvements in occupational health and safety management. The construction and operation processes of wind turbines incl...

متن کامل

Estimating Sequence Similarity from Contig Sets

A key task in computational biology is to determine mutual similarity of two genomic sequences. Current bio-technologies are usually not able to determine the full sequential content of a genome from biological material, and rather produce a set of large substrings (contigs) whose order and relative mutual positions within the genome are unknown. Here we design a function estimating the sequent...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computer applications in the biosciences : CABIOS

دوره 10 4  شماره 

صفحات  -

تاریخ انتشار 1994